Classification and analysis of a large collection of in vivo bioassay descriptions

نویسندگان

  • Magdalena Zwierzyna
  • John P. Overington
چکیده

Testing potential drug treatments in animal disease models is a decisive step of all preclinical drug discovery programs. Yet, despite the importance of such experiments for translational medicine, there have been relatively few efforts to comprehensively and consistently analyze the data produced by in vivo bioassays. This is partly due to their complexity and lack of accepted reporting standards-publicly available animal screening data are only accessible in unstructured free-text format, which hinders computational analysis. In this study, we use text mining to extract information from the descriptions of over 100,000 drug screening-related assays in rats and mice. We retrieve our dataset from ChEMBL-an open-source literature-based database focused on preclinical drug discovery. We show that in vivo assay descriptions can be effectively mined for relevant information, including experimental factors that might influence the outcome and reproducibility of animal research: genetic strains, experimental treatments, and phenotypic readouts used in the experiments. We further systematize extracted information using unsupervised language model (Word2Vec), which learns semantic similarities between terms and phrases, allowing identification of related animal models and classification of entire assay descriptions. In addition, we show that random forest models trained on features generated by Word2Vec can predict the class of drugs tested in different in vivo assays with high accuracy. Finally, we combine information mined from text with curated annotations stored in ChEMBL to investigate the patterns of usage of different animal models across a range of experiments, drug classes, and disease areas.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of an Automatic Land Use Extraction System in Urban Areas using VHR Aerial Imagery and GIS Vector Data

Lack of detailed land use (LU) information and efficient data collection methods have made the modeling of urban systems difficult. This study aims to develop a novel hierarchical rule-based LU extraction framework using geographic vector and remotely sensed (RS) data, in order to extract detailed subzonal LU information, residential LU in this study. The LU extraction system is developed to ex...

متن کامل

Methodology of Description in Shaykh al – Ishraq

  As an ontologist philosopher Shaykh al – Isharq believes in a heirarchegal being on the basis of which presents his classification of various descriptions. These descriptions are various both in terms of longitudinal and latitudinal. That is, for instance though his intative descriptions are at the latitude of his logical analytic descriptions, possesses itself a longitudinal order successiv...

متن کامل

کاربرد روش های بیوشیمیایی و بیواسی در تشخیص مقاومت به حشره کش های ارگانوفسفره و کاربامات در سوسری آلمانی

In this study we employed two methods for gauging the sensitivity of B. germanica strains to organophosphorus insecticides: an in-vivo bioassay that used linear regression analysis (with mortality on a probit scale and logarithm of concentration) and an in-vitro enzyme assay.In the bioassay method, B. germanica nymphs of stage 1 (2-3 days old) were exposed to patches of paper impregnated with 2...

متن کامل

Large B-cell lymphoma in a dog: A cyto-histopathological evaluation and Immunophenotyping according to WHO classification for canine lymphomas

In the present study, we described cyto-histopathological features and immunophenotyping of the large B-cell lymphoma in an 8-year-old mixed breed dog with applying the World Health Organization (WHO) system of classification of canine lymphomas. In fine-needle aspiration (FNA), lymph nodes were involved by neoplastic cells of intermediate to large size with deep blue cytoplasm; consist of cent...

متن کامل

In Vitro antibacterial and in Vivo cytotoxic activities of Grewia paniculata

Objectives: Grewia paniculata (Family: Malvaceae) has been used to treat inflammation, respiratory disorders and fever. It is additionally employed for other health conditions including colds, diarrhea and as an insecticide in Bangladesh. The aim of the present study was to investigate the antibacterial and cytotoxic activities of different extracts of Grewia paniculata. Materials and Methods: ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 13  شماره 

صفحات  -

تاریخ انتشار 2017